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HIGH-SPEED COMMUNICATION SYSTEM 
WITH A FEEDBACK SYNCHRONIZATION LOOP 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention . 

The present invention relates to a high-speed communication 
system and, more particularly, to a high-speed communication 
system with a feedback synchronization loop. 

2. Description of the Related Art . 

A gigabit communication system is a system that transfers 
billions of bits of data per second between the nodes of the system. 
Gigabit communication systems commonly handle the data 
transferred over the backplane of the internet, and are expected to 
handle the data transferred between next-generation processors and 
peripherals, such as hard drives and printers. 

FIG. 1 shows a block diagram that illustrates a conventional 
gigabit communication system 100. As shown in FIG. 1, system 
100 includes a high-speed transmission medium 108, such as a fiber 
optic cable, and a number of communication devices 1 10 that 
receives data from, and transmits data to, medium 108. 

Each communication device 1 10, in turn, includes a physical 
layer device 112 that is connected to medium 108, and a processing 
device 1 14 that is connected to physical layer device 1 1 2 by a 
number of lines 1 16. Physical layer device 112 includes a 
serializer/deserializer (serdes) that transforms data received from 
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medium 108 into a signal format that is compatible with processing 
device 1 14, and transforms data from processing device 1 14 into a 
signal format that is compatible with medium 108. 

When transferring data to, and receiving data from, 
processing circuit 1 14, the serdes typically utilizes a data signal 
which has a logic high that is represented by a maximum voltage 
which is equal to the supply voltage used by the processing circuit. 

For example, when device 1 14 is formed in a 0.35 micron 
photolithographic process, physical layer device 112 transmits data 
to, and receives data from, device 114 with data signals that have a 
maximum voltage of approximately 3.3V, the supply voltage 
commonly used with 0.35 micron devices. 

One channel of data is typically transported across medium 
108, and between physical layer device 112 and processing device 
1 14, at 1 .25 Gb/s, with speeds of 2.5 Gb/s under consideration. 
Processing device 1 14 processes the data received from medium 
108 by physical layer device 1 12, and outputs processed data to 
physical layer device 1 1 2 for transmission onto medium 108. 

Physical layer device 1 12 and processing device 1 14 are 
typically encapsulated in separate chips which are placed on the 
same printed circuit board due to the largely analog nature of device 
1 12 and the largely digital nature of device 114. One consequence 
of this approach, however, is that electromagnetic interference (EMI) 
requirements limit the maximum speed that data can be exchanged 
between devices 112 and 114. 

For example, when data is exchanged between devices 1 12 
and 1 14 with data signals having a maximum voltage of 
approximately 3.3V, the maximum speed that can be obtained 
without exceeding the EMI requirements is approximately 125 Mb/s. 
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Thus, to handle one channel of inbound data, which is 
received at 1.25 Gb/s, 10 inbound lines 1 1 6 are required to 
transport data from device 112 to device 1 14, where physical layer 
device 112 has 10 output ports and processing device 114 has 10 
input ports. (10 inbound lines 1 16 at 125 Mb/s provide one channel 
of inbound data at 1 .25 Gb/s). 

Similarly, processing device 114 requires 10 outbound lines 
1 16 to transport one channel of outbound data from device 1 14 to 
device 1 12, where processing device 1 14 has 10 output ports and 
physical layer device 112 has 10 input ports. Thus, device 112 and 
1 14 each require 20 input/output ports, with 20 corresponding pins, 
to handle the inbound and outbound data for one channel. 

To provide additional EMI margin and greater chip-to-chip 
spacing, communication devices with reduced chip-to-chip speeds 
are also available. These reduced-speed devices typically transfer 
data between devices 1 1 2 and 1 1 4 at 62.5 Mb/s. 

One problem with communication devices that have reduced 
chip-to-chip speeds, however, is that devices 1 12 and 114 have 
twice as many I/O ports and twice as many pins. Thus, with a 
reduced-speed device, devices 112 and 114 require 40 pins each 
(20 inbound lines 1 1 6 at 62.5 Mb/s are required to provide one input 
channel at 1.25 Gb/s, while 20 outbound lines 116 at 62.5 Mb/s are 
required to provide one outbound channel at 1.25 Gb/s). 

The pin problem becomes even worse when devices 112 and 
1 14 are packaged as four and eight-channel devices. When 
packaged in this way, devices 1 12 and 1 14, when operating at a 
high chip-to-chip speed, i.e., 125 Mb/s, each require 80 pins and 
160 pins to support four and eight-channel devices, respectively. 
Further, devices 1 12 and 1 14, when operating at a slower chip-to- 
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chip speed, i.e., 62.5 Mb/s, each require 160 pins and 320 pins to 
support four and eight-channel devices, respectively. 

The pin problem reaches critical stages when devices 1 12 and 
114 are scaled up to handle a 2.5 Gb/s data rate from the current 
1 .25 Gb/s data rate. At these higher speeds, devices 1 1 2 and 1 1 4, 
when operating at a high chip-to-chip speed, i.e., 125 Mb/s, require 
160 pins and 320 pins to support four and eight-channel devices, 
respectively. Further, devices 1 12 and 1 14, when operating at a 
slower chip-to-chip speed, i.e., 62.5 Mb/s, require 320 pins and 640 
pins to support four and eight-channel devices, respectively. 

Thus, there is a great need to reduce the pin counts of 
devices 1 1 2 and 1 1 4 when devices 1 1 2 and 1 1 4 are scaled up to 
handle a 2.5 Gb/s data rate. (In addition to consuming huge 
amounts of silicon real estate, large pin count devices also consume 
large amounts of power.) 

One conceptual approach to reducing the pin counts is to 
exchange data between devices 112 and 114 with a single-ended 
signal that has a lower maximum voltage. For example, by lowering 
the maximum voltage of a single-ended data signal from 3.3V to 
500mV, the frequency of the data signal can be increased from 125 
Mb/s to approximately 1.25 Gb/s without exceeding the EMI 
requirements. By lowering the maximum voltage from 3.3V to 
250mV, the frequency of the data signal can be increased from 125 
Mb/s to approximately 2.5 Gb/s without exceeding the EMI 
requirements. 

One problem with this conceptual approach, however, is that 
it is extremely difficult, if not impossible, to form inbound detectors 
on processing device 1 14, and outbound detectors on device 112, 
that accurately detect logic ones and logic zeros from a single-ended 
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gigahertz data signal that has a maximum voltage in the hundreds of 
millivolts due to the voltage margins required by the detectors. 

Another problem with this conceptual approach is that much 
more complex clock recovery circuitry is required to recover a clock 
signal from a data signal operating in the gigahertz range, such as 
2.5 GHz, than from a data signal operating in the megahertz range, 
such as 1 25MHz. Thus, much of the clock recovery circuitry that is 
utilized in the serdes would also be required in processing device 
1 14 to recover the clock from a gigahertz data signal (output by 
device 1 12 to device 1 14) that has a maximum voltage in the 
hundreds of millivolts. 

Another approach to reducing the pin count, that also avoids 
this duplication, is to integrate the functions of physical layer device 
112 and processing device 114 on a single chip. One problem with 
this approach, however, is the incompatibility of high-precision 
analog circuits, which make up most of the circuits on physical layer 
device 112, with digital circuits, which make up most of the circuits 
on processing device 114. 

One of these incompatibilities is the speed with which new 
processing technologies can be implemented. For the present, 
digital circuits are easily adapted to new (and smaller) processing 
technologies because the voltage levels that represent logic ones in 
the new processing technologies are still easily distinguished from 
the voltage levels that represent logic zeros. 

For example, in both a 0.5 micron photolithographic process 
and a 0.35 micron photolithographic process, where a logic one is 
represented by a 5V signal and a 3.3V signal, respectively, the logic 
one is easily distinguished from a logic zero which, in both cases, 
has a voltage near zero. 
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For high-precision analog circuits, however, moving from a 
0.5 micron photolithographic process to a 0.35 micron 
photolithographic process, where the supply voltage drops from 5V 
to 3.3V, dramatically reduces, among other things, the dynamic 
ranges of the analog devices. Further design (and time) is then often 
needed to develop devices which operate in these ranges. 

Thus, integrating the functions of physical layer device 112 
and processing device 1 14 on a single chip increases the time 
required for the digital circuitry in the integrated device to take 
advantage of the reduced size and power requirements provided by a 
new photolithographic process. 

As a result, there is a need for a communication device that 
has a physical layer device and a processing device which operate in 
the gigahertz frequency range with substantially fewer pins. 

SUMMARY OF THE INVENTION 

In a communication device having a physical layer device and 
a processing device connected to the physical layer device, the pin 
counts of the physical layer device and the processing device are 
substantially reduced when operating with gigahertz signals by 
utilizing millivolt differential signals. In addition, a calibration 
feedback loop synchronizes the data and clock signals on the 
processing device, thereby eliminating the need for a gigahertz clock 
recovery circuit on the processing device. 

A communications device in accordance with the present 
invention includes a physical layer device that has a media driver 
connectable to a transmission medium, a media receiver connectable 
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to the transmission medium, and a serializer/deserializer (serdes) 
connected to the media driver and the media receiver. The physical 
layer device also includes a master circuit that is connected to the 
serdes. The master circuit has a first physical layer data driver that 
drives a millivolt differential signal, and a first physical layer data 
receiver. 

The communications device further includes a processing 
circuit that has an internal circuit, and a slave circuit connected to 
the internal circuit and the master circuit. The slave circuit has a 
first processing data receiver connected to the first physical layer 
data driver. The first processing data receiver outputs a first signal 
in response to receiving the signal output from the first physical 
layer data driver. In addition, the slave circuit further includes a first 
processing data driver which is connected to the first physical layer 
data receiver, and connectable to the first processing data receiver. 

In addition, the master circuit further includes a clock driver 
which is connected to the serdes and outputs a millivolt differential 
signal, and the slave circuit further includes a clock receiver 
connected to the clock driver. The clock receiver outputs a clock 
signal in response to a signal received from the clock driver. 

Further, the first processing data driver is connectable to 
receive the clock signal from the clock receiver or the first signal 
from the first processing data receiver. The first physical layer data 
receiver receives the clock signal when the first processing data 
driver is connected to receive the clock signal, and receives the first 
signal when the first processing data driver is connected to receive 
the first signal. 

The master circuit additionally includes an aligner that is 
connected to the first physical layer data receiver. The aligner 
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receives the clock signal when the first physical layer data receiver 
receives the clock signal, and the first signal when the first physical 
layer data receiver receives the first signal. The aligner has phase 
comparison circuitry that compares the phase of the clock signal 
received by the aligner with the phase of the first signal received by 
the aligner to determine a phase difference. 

The master circuit further includes a phase delay circuit that is 
connected to the aligner, the serdes, and the first physical layer data 
driver. The aligner passes a plurality of signals to the phase delay 
circuit that indicates the phase difference. The phase delay circuit 
delays the signal output from the first physical layer data driver so 
that the first signal received by the aligner is substantially in phase 
with the clock signal received by the aligner. 

A better understanding of the features and advantages of the 
present invention will be obtained by reference to the following 
detailed description and accompanying drawings which set forth an 
illustrative embodiment in which the principles of the invention are 
utilized. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a block diagram illustrating a conventional gigabit 
communication system 100. 

FIG. 2 is a block diagram illustrating a communication device 
200 in accordance with the present invention. 

FIG. 3 is a graph illustrating the frequency F of the signal 
transmitted between the chips versus the distance D that separates 
the chips in accordance with the present invention. 
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DETAILED DESCRIPTION 

FIG. 2 shows a block diagram that illustrates a communication 
device 200 in accordance with the present invention. As shown in 
FIG. 2, device 200 includes a physical layer device 210 and a 
processing circuit 250 that is connected to device 210. Device 210 
receives information from, and transmits information to, a high- 
speed transmission medium 212, such as a fiber optic cable, while 
circuit 250 receives information from, and transmits information to, 
device 210. 

Device 210, in turn, includes a low-voltage, differential 
receiver 214 that outputs a pair of differential data signals RXR + 
and RXR- in response to receiving a pair of differential data signals 
RX+ and RX- from medium 212, and a low-voltage, differential 
driver 216 that outputs a pair of differential data signals TX+ and 
TX- to medium 212 in response to receiving a pair of differential 
data signals TXT+ and TXT-. 

Data is preferably received from, and transmitted to, medium 
212 at either 1.25 Gb/s, 1.6 Gb/s, 2.5 Gb/s, or 3.125 Gb/s. (The 
present invention is not limited to these frequencies, and may be 
used with other frequencies.) 

Receiver 214 and driver 216 are preferably low-voltage PECL 
optical and short copper receivers and drivers, respectively. (The 
present invention is not limited to these drivers and receivers, and 
may be used with other drivers and receivers.) 

In addition, device 210 also includes a gigabit 
serializer/deserializer (serdes) 220 that transforms the data signals 
RXR + and RXR- into differential input data signals DIN 4- and DIN-, 
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and transforms an output data signal DOUT into the data signals 
TXT+ and TXT-. A switch S switchably connects the input data 
signals DIN+ and DIN- to lines LI and L2 in response to a first logic 
state of a calibration command CMD. 

Serdes 220 also includes a phase-lock-loop circuit 222 that 
outputs a base clock signal (not shown) having, for example, a 
frequency of 250MHz, and a series of phase-delayed clock signals 
PD1-PDn which each have an equal phase delay. For example, each 
of five delayed clock signals can have a 72° phase shift (5*72 = 
360). 

Serdes 220 further includes a multiplier/switch circuit 224 
that multiplies the frequency of the base clock signal to output a pair 
of differential master clock signals MCLK + and MCLK-. The master 
clock signals MCLK+ and MCLK- are preferably output at either 
1.25 Gb/s, 1.6 Gb/s, 2.5 Gb/s, or 3.125 Gb/s. (The present 
invention is not limited to these frequencies, and may be used with 
other frequencies.) 

In addition, circuit 224 also forms a pair of differential slave 
clock signals SLV + and SLV- having the same frequency as the 
master clock signals MCLK+ and MCLK-. Switch S switchably 
connects the slave clock signals SLV+ and SLV- to lines L1 and L2 
in response to a second logic state of the calibration command CMD 
so that the slave clock signals SLV+ and SLV- are output onto lines 
L1 and L2 instead of the data signals DIN+ and DIN-. 

Serdes 220 is preferably implemented to comply with the IEEE 
802.3z Gigabit Ethernet standard. (The present invention is not 
limited to the 802. 3z Standard, and may be used with other 
standards.) 
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Device 210 further includes a master circuit 226 that is 
connected to serdes 220. Master circuit 226, in turn, includes a 
logic circuit 228 that outputs a pair of differential delayable signals 
DD+ and DD- in response to the signals on lines L1 and L2, and a 
series of select signals SEL1-SELs. Thus, when the input data 
signals DIN+ and DIN- are on lines L1 and L2, the delayable signals 
DD+ and DD- represent the input data signals DIN+ and DIN-; and 
when the slave clock signals SLV + and SLV- are on lines L1 and L2, 
the delayable signals DD+ and DD- represent the slave clock signals 
SLV+ and SLV-. 

Logic circuit 228 preferably provides a number of data paths 
that include a reference data path and a number of additional data 
paths that incrementally delay and/or advance the differential signals 
passing through circuit 228. The signals passing through circuit 
228, in turn, follow the data path selected by the select signals 
SEL1-SELs. (The present invention is not limited to circuit 228, and 
may be used with other selectably delayable circuits.) 

In addition, circuit 226 further includes a millivolt differential 
driver 230 that outputs a pair of differential transmit clock signals 
TX CLK+ and TX CLK- in response to the master clock signals 
MCLK + and MCLK-, and a millivolt differential driver 232 that 
outputs a pair of differential transmit signals TX DATA + and TX 
DATA- in response to the delayable signals DD+ and DD-. 

Thus, the transmit clock signals TX CLK+ and TX CLK- 
represent the master clock signals MCLK+ and MCLK-. In addition, 
the transmit signals TX DATA+ and TX DATA- represent the input 
data signals DIN-h and DIN- when the delayable signals DD + and 
DD- represent the input data signals DIN + and DIN-; and the slave 
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clock signals SLV+ and SLV- when the delayable signals DD+ and 
DD- represent the slave clock signals SLV+ and SLV-. 

Drivers 230 and 232 preferably output the transmit clock 
signals TX CLK + and TX CLK- and the transmit signals TX DATA + 
and TX DATA- at either 1.25 Gbs, 1.6 Gbs, 2.5 Gbs, or 3.125 Gbs. 
(The present invention is not limited to these frequencies, and may 
be used with other frequencies.) Thus, when the transmit signals 
TX DATA + and TX DATA- represent the input data signals DIN + 
and DIN-, data is output from device 210 at the same data rate that 
device 210 receives data from medium 212. 

In addition, the transmit clock signals TX CLK + and TX CLK- 
and the transmit signals TX DATA + and TX DATA- are preferably 
defined by the TIA/EIA-644 Low Voltage Differential Signal (LVDS) 
Standard (which defines a 250mV differential signal). (The present 
invention is not limited to the LVDS Standard, and may be used with 
other low-voltage standards.) One of the advantages of the present 
invention is that by using millivolt differential signals, the EMI 
problem encountered by conventional systems is effectively 
eliminated. 

Circuit 226 additionally includes a millivolt differential data 
receiver 234 that outputs a receive signal RD in response to 
receiving a pair of differential transmit signals RX DATA + and RX 
DATA-. 

Further, circuit 226 also includes a phase interpolation and 
aligner circuit 236 that outputs a series of finer delayed signals FD1- 
FDm, each having the same frequency and an equal phase delay, in 
response to each phase delayed signal PD. For example, if phase 
delayed signal PD1 represents a 0° phase shift, phase delayed signal 
PD2 represents a 72° phase shift, and phase delayed signal PD3 
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represents a 144° phase shift, 36 finer delayed signals each having 
a 2° phase shift are formed for signal PD1 , i.e., 0°, 2°, 4°, ... 70°, 
while 36 finer delayed signals each having a 2° phase shift are 
formed for signal PD2, i.e., 72, 74°, 76°, ... 142°. 

The resolution required by the system determines the number 
of finer delayed signals FD1-FDm that are used. For example, to 
have a 1 ° resolution rather than a 2° resolution, 72 finer delayed 
signals each having a 1 ° phase shift are formed for each of the 
phase delayed signals PD1-PDn. 

As described in greater detail below, circuit 236 also utilizes 
the finer delayed signals FD1-FDm to compare the phase of a first 
received signal RD received at a first time and the phase of a second 
received signal RD received at a later time, and outputs the select 
signals SEL1-SELs in response to the comparison such that the 
select signals SELI-SELs define a difference between the phases of 
the signals. 

Circuit 226 further includes a data recovery circuit 240 that 
outputs data signal DOUT and a recovered clock signal RCLK in 
response to the received signal RD and the finer delayed signals 
FD1-FDm. 

As noted above and as further shown in FIG. 2, device 200 
also includes processing circuit 250. Circuit 250, in turn, includes a 
slave circuit 252 and an internal circuit 254 that is connected to 
slave circuit 252. Slave circuit 252 includes a millivolt differential 
receiver 256 that outputs a processing clock signal PCLK in 
response to receiving the clock signals TX CLK+ and TX CLK-, and 
a millivolt differential receiver 258 that outputs a received signal 
RDATA in response to receiving the transmit signals TX DATA + and 
TX DATA-. 
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Thus, processing clock signal PCLK represents the master 
clock signal MCLK+ and MCLK-. In addition, the received signal 
RDATA represents the input data signals DIN+ and DIN- when the 
transmit signals TX DATA + and TX DATA- represent the input data 
signals DIN + and DIN-; and the slave clock signals SLV+ and SLV- 
when the transmit signals TX DATA + and TX DATA- represent the 
slave clock signals SLV+ and SLV-. 

Circuit 252 also includes a first shift register 260 that 
converts the serial received signal RDATA into an input parallel data 
signal DATAIN in response to the processing clock signal PCLK 
clocking the signal. Internal circuit 254 receives the processing 
clock signal PCLK and the parallel data signal DATAIN. 

The processing clock signal PCLK and the received signal 
RDATA have an equivalent path length to the first shift register 260 
{or a known phase delay). In addition, the processing clock signal 
PCLK and the data signals DATAIN are CMOS (or otherwise) 
compatible, and directly interface, with internal circuit 254. 

In addition, circuit 252 further includes a second shift register 
262 that directly interfaces with internal circuit 254 to convert a 
parallel data signal DATAOUT from internal circuit 254 into a serial 
data signal OUTD. Second shift register 262 utilizes the processing 
clock signal PCLK to clock the parallel data signal DATAOUT out of 
shift register 262. Thus, the data signal OUTD is synchronized to 
the processing clock signal PCLK. 

Circuit 252 further includes a first multiplexor 264 that passes 
either the processing clock signal PCLK or the received signal 
RDATA in response to a first mux signal MUX1, and a second 
multiplexor 266 that passes either the processing clock signal PCLK 
or the received signal RDATA, or the data signal OUTD in response 
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to a second mux signal MUX2. Further, the path lengths between 
receiver 256 and multiplexor 264, and between receiver 258 and 
multiplexor 264 are equal (or have a known phase difference). 

In addition, circuit 252 includes a millivolt differential driver 
270 that outputs the differential transmit signals RX DATA+ and RX 
DATA- in response to either the processing clock signal PCLK, the 
received signal RDATA, or the data signal OUTD, depending on 
which signal is passed by multiplexor 266. 

Circuit 252 further includes a logic circuit 272 that receives 
the processing clock signal PCLK from shift register 262, and 
outputs mux signals MUX1 or MUX2 to multiplexors 264 and 266, 
or a fixed pattern, e.g., 1-0-1-0-1-0, to shift register 262 in response 
to commands embedded in the processing clock signal PCLK (by 
serdes 220). 

The transmit signals RX DATA + and RX DATA- are preferably 
output at either 1.25 Gbs, 1.6 Gbs, 2.5 Gbs, or 3.125 Gbs. (The 
present invention is not limited to these frequencies, and may be 
used with other frequencies.) Thus, the transmit signals RX DATA + 
and RX DATA- are output from circuit 250, received by device 210, 
and output by device 210 to medium 212 at the same data rate. 

In addition, the data signals RX DATA + and RX DATA- are 
also preferably defined by the TIA/EIA-644 Low Voltage Differential 
Signal (LVDS) Standard. (The present invention is not limited to the 
LVDS Standard, and may be used with other low-voltage standards.) 

Thus, the received signal RD represents the data signal OUTD 
when the transmit signals RX DATA+ and RX DATA- represent the 
data signal OUTD; the received signal RDATA when the transmit 
signals RX DATA+ and RX DATA- represent the received signal 
RDATA; and the processing clock signal PCLK, which represents the 
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master clock signal MCLK + and MCLK-, when the transmit signals 
RX DATA 4- and RX DATA- represent the processing clock signal 
PCLK. 

In addition, when the received signal RDATA represents the 
slave clock signals SLV + and SLV-, the received signal RD 
represents the slave clock signals SLV+ and SLV-, and when the 
received signal RDATA represents the data signals DIN+ and DIN-, 
the received signal RD represents the signals DIN+ and DIN-. 

Device 210 and circuit 250 are formed in separate packages, 
and may be formed on the same printed circuit board, or may be 
formed on separate circuit boards and connected via a back plane or 
bus. In addition, the frequency of the signal transmitted between 
the packages is a function of the distance that separates the 
packages. 

FIG. 3 shows a graph that illustrates the frequency F of the 
signal transmitted between the chips versus the distance D that 
separates the chips. As shown in FIG. 3, if the chips are separated 
by 7.5cm, the maximum frequency F of the signal is approximately 
2.5 Gb/s; by 45cm, the maximum frequency F of the signal is 
approximately 1.25 Gb/s; and by 5 meters, the maximum frequency 
F of the signal is approximately 622 Mb/s. Thus, one of the 
advantages of the present invention is that devices 1 12 and 114 can 
be further spaced apart, or placed on different circuit boards. 

In operation, device 210 and circuit 250 include a calibration 
mode and a normal operational mode. In the calibration mode, 
serdes 220 outputs the master clock signals MCLK + and MCLK- 
with an embedded command at the frequency of data RX+ and RX-. 

The clock signals MCLK + and MCLK- are driven by driver 230 
as the clock signals TX CLK + and TX CLK-. In response to the 
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clock signals TX CLK-f and TX CLK-, receiver 256 outputs the 
processing clock signal PCLK to shift registers 260 and 262, and 
multiplexor 264. 

The embedded clock command is passed through shift register 
262 to logic circuit 272 which/ in turn, sets the logic states of the 
MUX1 and MUX2 signals to pass the processing clock signal PCLK 
through multiplexors 264 and 266. The processing clock signal 
PCLK is then driven out of circuit 252 by driver 270 so that the 
transmit signals RX DATA + and RX DATA- represent the processing 
clock signal PCLK. 

Receiver 234 outputs the received signal RD, which 
represents processing clock signal PCLK, to aligner 236 in response 
to the transmit signals RX DATA + and RX DATA-. Aligner 236 
compares the phase of the received signal RD with the finer delayed 
signals FD1-FDm of each phase delayed signal PD, and identifies a 
finer delayed signal FD that is closest in phase with the received 
signal RD. Thus, since the received signal RD represents the 
processing clock signal PCLK which, in turn, represents the master 
clock signals MCLK+ and MCLK-, aligner 236 identifies a finer 
delayed signal FD that is closest in phase with the master clock 
signals MCLK + MCLK-. 

In response to either a time out or a signal from aligner 236 
that the phase of the master clock signals MCLK-f MCLK- has been 
identified', serdes 220 changes the command embedded in the clock 
signals MCLK+ and MCLK-, and again outputs the clock signals 
MCLK + and MCLK-. In addition, serdes 220 also switchingly 
connects the slave clock signals SLV+ and SLV- to lines L1 and L2. 

The slave clock signals SLV+ and SLV- are passed through 
the reference data path and output as the delayable signals DD + 
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and DD- which, in turn, are driven by driver 232 as the transmit 
signals TX DATA + and TX DATA-. In response to the transmit 
signals TX DATA + and TX DATA-, receiver 258 outputs the 
received signal RDATA, representing the slave clock signals SLV + 
and SLV-, which is presented to multiplexor 264. 

As before, the clock signals MCLK+ and MCLK- are driven by 
driver 230 as the clock signals TX CLK+ and TX CLK-. In response 
to the clock signals TX CLK+ and TX CLK-, receiver 256 outputs 
the processing clock signal PCLK to shift registers 260 and 262, and 
multiplexor 264. 

The embedded clock command is passed through shift register 
262 to logic circuit 272 which, in turn, sets the logic states of the 
MUX! and MUX2 signals to pass the received signal RDATA through 
multiplexors 264 and 266. The received signal RD is then driven out 
of circuit 252 by driver 270 as the transmit signals RX DATA + and 
RX DATA-. 

Receiver 234 outputs the received signal RD, which 
represents the slave clock signals SLV+ and SLV-, to aligner 236 in 
response to the transmit signals RX DATA + and RX DATA-. Aligner 
236 compares the phase of the received signal RD with the finer 
delayed signals FD1-FDm of each phase delayed signal PD, and 
identifies a finer delayed signal FD that is closest in phase with the 
received signal RD. Thus, since the received signal RD represents 
the slave clock signals SLV+ and SLV-, aligner 236 identifies a finer 
delayed signal FD that is closest in phase with the slave clock 
signals SLV+ and SLV-. 

Aligner 236 compares the phase of the master clock signals 
MCLK + and MCLK- and the slave clock signals SLV + and SLV-, 
and outputs select signals SEL1-SELs which identify a synch data 
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path from the additional data paths through logic circuit 228 such 
that, when the slave clock signals SLV + and SLV- on lines L1 and 
L2 pass through the synch data path, the phase of the master clock 
signals MCLK+ and MCLK- and the phase of the slave clock signals 
SLV + and SLV- are in phase when arriving at aligner 236. (The 
present invention is not limited to aligner 236, and may be used with 
other circuits which determine the difference in phase between the 
slave clock signals SLV+ and SLV- and the master clock signals 
MCLK+ and MCLK- to determine the appropriate select signals 
SEL1-SELs.) 

Since the signal path for the signals that represent the master 
clock signals MCLK + and MCLK- and the slave clock signals SLV + 
and SLV- is the same from multiplexor 264 to aligner 236, the 
signals at the outputs of receivers 256 and 258 are also 
synchronized. Further, the processing clock signal PCLK at the input 
to internal circuit 254 and the parallel data signal DATAIN at the 
input to internal circuit 254 are also synchronized as the slew 
associated with first shift register 260 is negligible. 

As noted above, device 210 and circuit 250 also include a 
normal operational mode. In the normal operational mode, 
multiplier/switch circuit 224 of serdes 220 outputs the clock signals 
MCLK + and MCLK- with an embedded command at the frequency 
of data RX, and switchably connects the data signals DIN+ and 
DIN- to lines L1 and L2. 

As a result, the received signal RDATA, which represents the 
input data signals DIN+ and DIN-, is synchronously provided to first 
shift register 260 with the processing clock signal PCLK. In 
addition, the embedded clock command is passed through shift 
register 262 to logic circuit 272 which, in turn, sets the logic state 
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of the MUX2 signal to pass the data signal OUTD through 
multiplexor 266. 

When working in the gigahertz range, simply providing 
equivalent path lengths for the clock and data signals passing 
through device 210 and circuit 250, which are formed in discrete 
packages, is insufficient to insure that the clock and data signals are 
synchronized because differences between the bonding leads and 
bonding conductions of the discrete packages unacceptably delay 
the clock signal with respect to the data signal, or vice versa. 

The present invention overcomes this problem by providing 
two independent paths (for the clock and data signals) and one 
common feedback path. By synchronizing the signals at the end of 
the common path (at aligner 236), the signals at the beginning of the 
common path (at multiplexor 264) are also synchronized. By 
connecting the common path to the independent paths in the 
package that supports circuit 250, the problems caused by the 
differences between the bonding leads and bonding conductions of 
the discrete packages are eliminated. 

The present invention provides a number of advantages over 
the prior art. In addition to eliminating the EMI problem as noted 
above, the present invention significantly reduces the pin count of 
the packages that hold device 210 and circuit 250. 

In the present invention, device 210 requires six pins for 
transmission and reception with circuit 250, two each for drivers 
230 and 232, and receiver 234. Similarly, circuit 250 requires six 
pins for transmission and reception with device 210, two each for 
receivers 256 and 258, and driver 270. 

This compares with the 20 to 40 pins that are required per 
channel for each of the prior art packages for transmission and 
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reception between device 210 and circuit 250. By substantially 
reducing the pin count, the present invention reduces the amount of 
silicon real estate required to implement device 210 and circuit 250 
which, in turn, reduces the manufacturing costs and increases the 
manufacturing yield. 

Another advantage is that, since device 210 and circuit 250 
are formed in different packages, device 210 and circuit 250 can be 
formed with different processing technologies. Digital circuit 250, 
including the analog elements in slave circuit 252, is easily scaled 
down with advances in processing technology, e.g., 0.25 micron to 
0.18 micron technologies, since there are so few analog devices in 
slave circuit 252. Device 210, which is largely analog and more 
difficult to scale down, can continue to use existing processing 
technologies until newer processes have become proven. 

A further advantage of the present invention is that by 
synchronizing the clock and data signals on processing circuit 252, 
the need for a clock recovery circuit on processing circuit 252 (to 
recover the clock from the data input from the physical layer device) 
is eliminated, thereby saving additional silicon real estate and 
simplifying the design of slave circuit 252. 

In accordance with an alternate embodiment of the present 
invention, a number of channels, such as four or eight, can be 
utilized in lieu of a single channel. In the alternate embodiment, 
each channel includes an inbound path having a line connected to 
serdes 220 and a logic circuit, a driver connected to the logic circuit, 
a receiver connectable to the driver, and a shift register connected to 
the receiver. Each channel further includes an outbound path having 
a shift register, a driver connectable to the shift register, a receiver 
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connectable to the driver and connected to data recovery circuit 
240. 

It should be understood that various alternatives to the 
embodiment of the invention described herein may be employed in 
practicing the invention. Thus, it is intended that the following 
claims define the scope of the invention and that methods and 
structures within the scope of these claims and their equivalents be 
covered thereby. 
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